Design and Scaffolded Training of an Efficient DNN Operator for Computer Vision on the Edge
نویسندگان
چکیده
Massively parallel systolic arrays and resource-efficient depthwise separable convolutions are two promising hardware software techniques to accelerate DNN inference on the edge. Interestingly, their combination is inefficient: Computational patterns of do not exhibit a rhythmic flow lack sufficient data reuse saturate arrays. In this article, we formally analyse inefficiency propose an efficient operator, optimal dataflow, superior training methodology towards alleviating this. The called Fully-Separable Convolutions (FuSeConv) , 1 drop-in replacement for depthwise-separable convolutions. FuSeConv generalizes factorization convolution fully along spatial depth dimensions. resultant computation efficiently maps Spatial-Tiled Output Stationary (ST-OS) maximizes efficiency It independent rows array maximise resource-utilization with negligible VLSI overheads. Neural Operator Scaffolding (NOS) scaffolds operators by distilling knowledge from more expensive operation. This bridges accuracy gap between networks Additionally, NOS can be combined Architecture Search (NAS) trade off latency accuracy. hardware-software co-design ST-OS achieves significant speedup 4.1-9.25× state-of-the-art ImageNet dataset. parameter its superiority over illustrates promise as strong solution Training comparable baselines. Further, combining NAS, design that define models improving both computer vision
منابع مشابه
nano-rods zno as an efficient catalyst for the synthesis of chromene phosphonates, direct amidation and formylation of amines
چکیده ندارد.
an investigation of the impact of self monitoring on langauge teachers motivational practice and its effect on learners motivation
the central purpose of this study was to conduct a case study about the role of self monitoring in teacher’s use of motivational strategies. furthermore it focused on how these strategies affected students’ motivational behavior. although many studies have been done to investigate teachers’ motivational strategies use (cheng & d?rnyei, 2007; d?rnyei & csizer, 1998; green, 2001, guilloteaux & d?...
a study on the design of bio-ethanol process from date wastes of sistan and baluchistan province
اتانول کاربردهای متنوعی در صنایع لاستیک سازی، رنگسازی، حلالها ومکمل سوخت خودرو دارد. اتانول برخلاف نفت از جمله مواد تجدیدپذیر محسوب می شود که مشکلات زیست محیطی و آلودگی نیز ایجاد نمی کند. استفاده از اتانول به عنوان مکمل سوختخودروها از جمله مهمترین مصارف صنعتی این ماده بشمار می رود. با توجه به این موضوع تحقیق و توسعه در زمینه تولید اتانول با درجه خلوص بالا در سطح جهان، و نه تنها در کشور های پیشر...
investigating the feasibility of a proposed model for geometric design of deployable arch structures
deployable scissor type structures are composed of the so-called scissor-like elements (sles), which are connected to each other at an intermediate point through a pivotal connection and allow them to be folded into a compact bundle for storage or transport. several sles are connected to each other in order to form units with regular polygonal plan views. the sides and radii of the polygons are...
development and implementation of an optimized control strategy for induction machine in an electric vehicle
in the area of automotive engineering there is a tendency to more electrification of power train. in this work control of an induction machine for the application of electric vehicle is investigated. through the changing operating point of the machine, adapting the rotor magnetization current seems to be useful to increase the machines efficiency. in the literature there are many approaches wh...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions in Embedded Computing Systems
سال: 2022
ISSN: ['1539-9087', '1558-3465']
DOI: https://doi.org/10.1145/3511212